The Speech service is the unification of speech-to-text, text-to-speech, and speech-translation into a single Azure subscription. It's speech capabilities enable applications, tools, and devices with the Speech CLI, Speech SDK, Speech Devices SDK, Speech Studio, or REST APIs.
Services include:
Speech to Text - Transcribe audio in more than 92 languages and variants. Gain customer insights with call center transcription, improve experiences with voice-enabled assistants, and capture key discussions in meetings.
Text to Speech - Create apps and services that speak conversationally, choosing from more than 215 voices, and 60 languages and variants. Create natural-sounding audio content, improve accessibility with read-aloud functionality, and create custom voice assistants.
Speech Translation - Translate audio from more than 30 languages and customize translations for organization's specific terms in a preferred programming language.
Speaker Recognition - Confirm a person's identity or recognize who's speaking in a meeting by adding speaker verification and identification to an app.
Custom Commands - Users can build a touchless, voice-first experience to improve safety and support back-to-work scenarios.
Custom Keywords - Custom keyword for IoT devices and voice-enabled assistants to set your brand apart—making it more personal, personable, and secure.